DS-Net: Dynamic spatiotemporal network for video salient object detection
نویسندگان
چکیده
As moving objects always draw more attention of human eyes, the temporal motion information is exploited complementarily with spatial to detect salient in videos. Although efficient tools such as optical flow have been proposed extract information, it often encounters difficulties when used for saliency detection due movement camera or partial objects. In this paper, we investigate complementary roles and propose a novel dynamic spatiotemporal network (DS-Net) effective fusion information. We construct symmetric two-bypass explicitly features. A weight generator (DWG) designed automatically learn reliability corresponding branch. And top-down cross attentive aggregation (CAA) procedure facilitate Finally, features are modified by guidance coarse map then go through decoder part final map. Experimental results on five benchmarks VOS, DAVIS, FBMS, SegV2, ViSal demonstrate that method achieves superior performance than state-of-the-art algorithms. The source code available at https://github.com/TJUMMG/DS-Net.
منابع مشابه
Video Salient Object Detection Using Spatiotemporal Deep Features
This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...
متن کاملSalient Region Detection in Video Using Spatiotemporal Visual Attention Model
Abstract Salient region detection is very useful in video analysis. A salient region detection method based on spatiotemporal visual attention model is proposed in this paper. Visual attention mechanism is used to generate saliency map of the image sequence. Spatial saliency map is computed in accordance with some predefined features including intensity, color and orientation. Temporal visual s...
متن کاملMSDNN: Multi-Scale Deep Neural Network for Salient Object Detection
Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...
متن کاملEfficient Co-Salient Video Object Detection Based on Preattentive Processing
Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...
متن کاملImpression Network for Video Object Detection
Video object detection is more challenging compared to image object detection. Previous works proved that applying object detector frame by frame is not only slow but also inaccurate. Visual clues get weakened by defocus and motion blur, causing failure on corresponding frames. Multiframe feature fusion methods proved effective in improving the accuracy, but they dramatically sacrifice the spee...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Digital Signal Processing
سال: 2022
ISSN: ['1051-2004', '1095-4333']
DOI: https://doi.org/10.1016/j.dsp.2022.103700